Vision Mcp Server | 图片分析 Mcp

Created By

Markusbetter8 months ago

This MCP addresses the visual recognition limitations of text-based models by enabling accurate image description and identification, making it excellent for AI-assisted reference design interface analysis. It currently supports dropping links into the dialog box or placing images in the project folder for recognition. The tool can be integrated with MCP platforms like Claude Code, Cline, and Trae. Beyond programming applications, it also provides visual recognition capabilities for models that lack native image processing functionality. For visual models, users can select their preferred model from ModelScope community and replace it during MCP configuration setup. 📱 Daily Use Cases: Send screenshots to directly identify errors or issues Share image links or place screenshots in the project folder for AI-assisted layout optimization Submit product image links to generate promotional copy 该mcp可以解决文字模型图片识别的视觉的问题，可以准确识别描述图片，用来给AI看参考设计界面很nice~ 目前支持丢链接到对话框，以及把图片放到项目文件夹进行识别。支持加入到Claude Code，Cline和Trae等mcp工具中。除了编程外，如果你使用的模型本身不支持视觉图片识别，也可以使用~ 视觉模型可以自己去魔搭社区选一个自己喜欢的，在填写mcp配置的时候替换即可 📱 日常使用场景 - 截图发过去，直接告诉哪里出错了 - 丢过去一个图片链接或者截图放到项目文件夹内，让AI帮忙优化布局 - 发个产品图链接，让AI写推广文案

# vision

# image-analysis

Overview Content Tools Comments

Content

Vision MCP Server | 图片分析 MCP

English | 中文

中文

一个用于图片分析的 MCP (Model Context Protocol) 服务器，支持图片内容分析和描述。例如当你在客户端的模型只支持文字输入，这时你可以使用视觉模型mcp来弥补。这个项目采用了魔搭社区免费的视觉模型Qwen3-VL-30B-A3B-Instruct（你也可以在配置中，使用魔搭社区自行更换为自己想要的视觉模型）。

功能特点

支持本地图片文件和在线图片 URL
基于魔搭社区 AI 模型的智能图像分析
完全兼容 MCP 协议
TypeScript 支持，提供完整的类型定义

安装

方式一：使用 npx（推荐）

无需预先安装，在客户端填写以下内容npx 会自动下载并运行最新版本：

{
  "mcpServers": {
    "vision-mcp-server": {
      "command": "npx",
      "args": [
        "-y",
        "vision-mcp-server"
      ],
      "env": {
        "MODELSCOPE_TOKEN": "your_modelscope_token_here",
        "MODELSCOPE_MODEL": "Qwen/Qwen3-VL-30B-A3B-Instruct"
      }
    }
  }
}

方式二：全局安装

npm install -g vision-mcp-server

然后在客户端配置中：

{
  "mcpServers": {
    "vision-mcp-server": {
      "command": "vision-mcp-server",
      "env": {
        "MODELSCOPE_TOKEN": "your_modelscope_token_here",
        "MODELSCOPE_MODEL": "Qwen/Qwen3-VL-30B-A3B-Instruct"
      }
    }
  }
}

方式三：本地安装

npm install vision-mcp-server

然后在客户端配置中：

{
  "mcpServers": {
    "vision-mcp-server": {
      "command": "node",
      "args": ["node_modules/vision-mcp-server/dist/index.js"],
      "env": {
        "MODELSCOPE_TOKEN": "your_modelscope_token_here",
        "MODELSCOPE_MODEL": "Qwen/Qwen3-VL-30B-A3B-Instruct"
      }
    }
  }
}

环境变量配置

在使用前，需要设置以下环境变量：

MODELSCOPE_TOKEN: 魔搭社区的 API 密钥（必需）
- 获取方式：访问魔搭社区 → 个人中心 → API令牌
MODELSCOPE_MODEL: 使用的模型名称（可选，默认为 "Qwen/Qwen3-VL-30B-A3B-Instruct"）
- 支持其他视觉模型，如：Qwen/Qwen2-VL-7B-Instruct

使用示例

// 分析本地图片
{
  "name": "analyze_image",
  "arguments": {
    "image": "/path/to/your/image.jpg",
    "prompt": "请描述这张图片的内容"
  }
}

// 分析在线图片
{
  "name": "analyze_image",
  "arguments": {
    "image": "https://example.com/image.jpg",
    "prompt": "这张图片中有哪些物体？"
  }
}

API 参考

analyze_image

分析图片内容并提供详细描述。

参数:

image (string): 图片 URL 或本地文件路径
prompt (string, 可选): 对图片的问题或分析要求，默认为 "请描述这张图片的内容"

返回: 图片内容的详细文本描述。

开发

构建

npm run build

测试

npm test

贡献

欢迎提交 Issue 和 Pull Request！

许可证

MIT

更新日志

1.0.0

初始版本发布
支持图片分析功能
兼容 MCP 协议

English

A Vision Analysis MCP (Model Context Protocol) Server that supports image content analysis and description.

Features

Support for local image files and online image URLs
Intelligent image analysis based on ModelScope AI models
Full compatibility with MCP protocol
TypeScript support with complete type definitions

Installation

Option 1: Using npx (Recommended)

No need to pre-install, npx will automatically download and run the latest version:

{
  "mcpServers": {
    "vision-mcp-server": {
      "command": "npx",
      "args": [
        "-y",
        "vision-mcp-server"
      ],
      "env": {
        "MODELSCOPE_TOKEN": "your_modelscope_token_here",
        "MODELSCOPE_MODEL": "Qwen/Qwen3-VL-30B-A3B-Instruct"
      }
    }
  }
}

Option 2: Global Installation

npm install -g vision-mcp-server

Then in your client configuration:

{
  "mcpServers": {
    "vision-mcp-server": {
      "command": "vision-mcp-server",
      "env": {
        "MODELSCOPE_TOKEN": "your_modelscope_token_here",
        "MODELSCOPE_MODEL": "Qwen/Qwen3-VL-30B-A3B-Instruct"
      }
    }
  }
}

Option 3: Local Installation

npm install vision-mcp-server

Then in your client configuration:

{
  "mcpServers": {
    "vision-mcp-server": {
      "command": "node",
      "args": ["node_modules/vision-mcp-server/dist/index.js"],
      "env": {
        "MODELSCOPE_TOKEN": "your_modelscope_token_here",
        "MODELSCOPE_MODEL": "Qwen/Qwen3-VL-30B-A3B-Instruct"
      }
    }
  }
}

Environment Variables Configuration

Before using, you need to set the following environment variables:

MODELSCOPE_TOKEN: ModelScope API key (required)
- Get it from: ModelScope → Profile → API Token
MODELSCOPE_MODEL: Model name to use (optional, default is "Qwen/Qwen3-VL-30B-A3B-Instruct")
- Supports other vision models, such as: Qwen/Qwen2-VL-7B-Instruct

Usage Examples

// Analyze local image
{
  "name": "analyze_image",
  "arguments": {
    "image": "/path/to/your/image.jpg",
    "prompt": "Please describe the content of this image"
  }
}

// Analyze online image
{
  "name": "analyze_image",
  "arguments": {
    "image": "https://example.com/image.jpg",
    "prompt": "What objects are in this image?"
  }
}

API Reference

analyze_image

Analyze image content and provide detailed description.

Parameters:

image (string): Image URL or local file path
prompt (string, optional): Question or analysis requirement for the image, default is "Please describe the content of this image"

Returns: Detailed text description of the image content.

Development

Build

npm run build

Test

npm test

Contributing

Issues and Pull Requests are welcome!

License

MIT

Changelog

1.0.0

Initial release
Image analysis support
MCP protocol compatibility

Server Config

{
  "mcpServers": {
    "vision-mcp-server": {
      "command": "npx",
      "args": [
        "-y",
        "vision-mcp-server"
      ],
      "env": {
        "MODELSCOPE_TOKEN": "your_modelscope_token_here",
        "MODELSCOPE_MODEL": "Qwen/Qwen3-VL-30B-A3B-Instruct"
      }
    }
  }
}

Recommend Servers

TraeBuild with Free GPT-4.1 & Claude 3.7. Fully MCP-Ready.

RedisA Model Context Protocol server that provides access to Redis databases. This server enables LLMs to interact with Redis key-value stores through a set of standardized tools.

EdgeOne Pages MCPAn MCP service designed for deploying HTML content to EdgeOne Pages and obtaining an accessible public URL.

WindsurfThe new purpose-built IDE to harness magic

Visual Studio Code - Open Source ("Code - OSS")Visual Studio Code

Y GuiA web-based graphical interface for AI chat interactions with support for multiple AI models and MCP (Model Context Protocol) servers.

Playwright McpPlaywright MCP server

AiimagemultistyleA Model Context Protocol (MCP) server for image generation and manipulation using fal.ai's Stable Diffusion model.

Amap Maps高德地图官方 MCP Server

Serper MCP ServerA Serper MCP Server

BlenderBlenderMCP connects Blender to Claude AI through the Model Context Protocol (MCP), allowing Claude to directly interact with and control Blender. This integration enables prompt assisted 3D modeling, scene creation, and manipulation.